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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 




(i) APPLICANT: ASHIKARI , Toshihiko 
TANAKA, Yoshikazu 
FUJIWARA, Hiroyuki 
NAKAO, Masahiro 
FUKUI , Yuko 
SAKAKIBARA, Keiko 
MI ZUTANI , Masako 
KUSUMI , Takaaki 

(ii) TITLE OF INVENTION: GENE ENCODING A PROTEIN HAVING ACYL 
GROUP TRANSFER ACTIVITY 



(iii) NUMBER OF SEQUENCES: 31 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: BURNS, DOANE, SWECKER & MATHIS , L.L.P. 

(B) STREET: 1737 King Street, Suite 500 

(C) CITY: Alexandria 

(D) STATE : Virginia 

(E) COUNTRY: United States 

(F) ZIP: 22314-2756 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE : Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.30 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/8 94,356 

(B) FILING DATE: 18-AUG-1997 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 7-67159 

(B) FILING DATE: 17-FEB-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 7-196915 

(B) FILING DATE: 29-JUN-1995 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 8-46 534 

(B) FILING DATE: 30-JAN-1996 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: WO PCT/ JP96/00348 

(B) FILING DATE: 16-FEB-1996 



(viii) 



ATTORNEY /AGENT INFORMATION : 
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(A) NAME: Meuth, Donna M. 

(B) REGISTRATION NUMBER: 36,607 

(C) REFERENCE /DOCKET NUMBER: 001560-308 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (703) 836-6620 

(B) TELEFAX: (703) 836-2021 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1703 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gentiana triflora var . japonica 
(F) TISSUE TYPE: petal 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA library 

(B) CLONE: pGAT4 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 6 . . 1412 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

TCATT ATG GAG CAA ATC CAA ATG GTG AAG GTT CTT GAA AAA TGC CAA 4 7 

Met Glu Gin lie Gin Met Val Lys Val Leu Glu Lys Cys Gin 



1 



5 



10 



GTT ACA CCA CCA TCT GAC 
Val Thr Pro Pro Ser Asp 
15 20 



ACA 
Thr 



ACA GAT GTC GAG TTA TCG CTA CCG GTA 
Thr Asp Val Glu Leu Ser Leu Pro Val 
25 30 



95 



ACA TTC TTC GAT ATC CCC 
Thr Phe Phe Asp lie Pro 
35 



TGG 
Trp 



TTG CAC TTG AAT AAG ATG CAG TCC CTT 
Leu His Leu Asn Lys Met Gin Ser Leu 
40 45 



143 



CTG TTT TAC GAC TTT CCG 
Leu Phe Tyr Asp Phe Pro 
50 



TAC 
Tyr 



CCA AGA ACA CAT TTC TTG GAC ACT GTT 
Pro Arg Thr His Phe Leu Asp Thr Val 
55 60 



191 
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ATC CCT AAT CTT AAG GCC TCT TTG TCT CTC ACT CTA AAA CAC TAC GTT 23 9 

lie Pro Asn Leu Lys Ala Ser Leu Ser Leu Thr Leu Lys His Tyr Val 

65 70 75 

CCG CTT AGC GGA AAT TTG TTG ATG CCG ATC AAA TCG GGC GAA ATG CCG 2 87 

Pro Leu Ser Gly Asn Leu Leu Met Pro lie Lys Ser Gly Glu Met Pro 

80 85 90 

AAG TTT CAG TAC TCC CGT GAT GAG GGC GAC TCG ATA ACT TTG ATC GTT 335 

Lys Phe Gin Tyr Ser Arg Asp Glu Gly Asp Ser lie Thr Leu lie Val 

95 100 105 110 

GCG GAG TCT GAC CAG GAT TTT GAC TAC CTT AAA GGT CAT CAA CTG GTA 383 

Ala Glu Ser Asp Gin Asp Phe Asp Tyr Leu Lys Gly His Gin Leu Val 

115 120 125 

GAT TCC AAT GAT TTG CAT GGC CTT TTT TAT GTT ATG CCA CGG GTT ATA 431 

Asp Ser Asn Asp Leu His G.l y .Leu P.be Tyr Val Met Pro Arg Val lie 

130 135 140 

AGG ACC ATG CAA GAC TAT AAA GTG ATC CCG CTC GTA GCC GTG CAA GTA 4 79 

Arg Thr Met Gin Asp Tyr Lys Val lie Pro Leu Val Ala Val Gin Val 

145 150 155 

ACC GTT TTT CCT AAC CGT GGC ATA GCC GTG GCT CTG ACG GCA CAT CAT 52 7 

Thr Val Phe Pro Asn Arg Gly lie Ala Val Ala Leu Thr Ala His His 

160 165 170 

TCA ATT GCA GAT GCT AAA AGT TTT GTA ATG TTC ATC AAT GCT TGG GCC 575 

Ser lie Ala Asp Ala Lys Ser Phe Val Met Phe lie Asn Ala Trp Ala 

175 180 185 190 

TAT ATT AAC AAA TTT GGG AAA GAC GCG GAC TTG TTG TCC GCG AAT CTT 6 23 

Tyr lie Asn Lys Phe Gly Lys Asp Ala Asp Leu Leu Ser Ala Asn Leu 

195 200 205 

CTT CCA TCT TTC GAT AGA TCG ATA ATC AAA GAT CTG TAT GGC CTA GAG 6 71 

Leu Pro Ser Phe Asp Arg Ser lie lie Lys Asp Leu Tyr Gly Leu Glu 

210 215 220 

GAA ACA TTT TGG AAC GAA ATG CAA GAT GTT CTT GAA ATG TTC TCT AGA 719 

Glu Thr Phe Trp Asn Glu Met Gin Asp Val Leu Glu Met Phe Ser Arg 

225 230 235 

TTT GGA AGC AAA CCC CCT CGA TTC AAC AAG GTA CGA GCT ACA TAT GTC 76 7 

Phe Gly Ser Lys Pro Pro Arg Phe Asn Lys Val Arg Ala Thr Tyr Val 

240 245 250 

CTC TCC CTT GCT GAA ATC CAG AAG CTA AAG AAC AAA GTA CTG AAT CTC 815 

Leu Ser Leu Ala Glu lie Gin Lys Leu Lys Asn Lys Val Leu Asn Leu 

255 260 265 270 

AGA GGA TCC GAA CCG ACA ATA CGT GTA ACG ACG TTC ACA ATG ACG TGT 8 63 

Arg Gly Ser Glu Pro Thr lie Arg Val Thr Thr Phe Thr Met Thr Cys 

275 280 285 
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GGA TAC GTA TGG ACA TGC ATG GTC AAA TCA AAA GAT GAC GTC GTA TCA 911 
Gly Tyr Val Trp Thr Cys Met Val Lys Ser Lys Asp Asp Val Val Ser 
290 295 300 

GAG GAA TCA TCG AAC GAC GAA AAT GAG CTC GAG TAC TTC AGT TTT ACA 95 9 

Glu Glu Ser Ser Asn Asp Glu Asn Glu Leu Glu Tyr Phe Ser Phe Thr 
305 310 315 

GCG GAT TGC CGA GGA CTT CTG ACG CCC CCG TGT CCG CCT AAC TAC TTT 10 07 

Ala Asp Cys Arg Gly Leu Leu Thr Pro Pro Cys Pro Pro Asn Tyr Phe 
320 325 330 

GGC AAC TGT CTT GCG TCA TGC GTT GCA AAA GCA ACA CAT AAA GAG TTA 105 5 

Gly Asn Cys Leu Ala Ser Cys Val Ala Lys Ala Thr His Lys Glu Leu 
335 340 345 350 

GTT GGG GAT AAA GGG CTT CTT GTT GCA GTT GCA GCT ATT GGA GAA GCC 1103 
Val Gly Asp Lys Gly Leu Leu Val Ala VaJ Ala Ala lie Gly Glu Ala 
355 360 365 

ATT GAA AAG AGG TTG CAC AAC GAA AAA GGC GTT CTT GCA GAT GCA AAA 1151 
lie Glu Lys Arg Leu His Asn Glu Lys Gly Val Leu Ala Asp Ala Lys 
370 375 380 

ACT TGG TTA TCG GAA TCT AAT GGA ATC CCT TCA AAA AGA TTT CTC GGG 1199 
Thr Trp Leu Ser Glu Ser Asn Gly lie Pro Ser Lys Arg Phe Leu Gly 
385 390 395 

ATT ACC GGA TCG CCT AAG TTC GAT TCG TAT GGT GTA GAT TTT GGA TGG 124 7 

lie Thr Gly Ser Pro Lys Phe Asp Ser Tyr Gly Val Asp Phe Gly Trp 
400 405 410 

GGA AAG CCT GCA AAA TTT GAC ATT ACC TCT GTT GAT TAT GCA GAA TTG 12 9 5 

Gly Lys Pro Ala Lys Phe Asp lie Thr Ser Val Asp Tyr Ala Glu Leu 
415 420 425 430 

ATT TAT GTG ATT CAG TCC AGG GAT TTT GAA AAA GGT GTG GAG ATT GGA 134 3 

He Tyr Val He Gin Ser Arg Asp Phe Glu Lys Gly Val Glu He Gly 
435 440 445 

GTA TCA TTG CCT AAG ATT CAT ATG GAT GCA TTT GCA AAA ATC TTT GAA 13 91 

Val Ser Leu Pro Lys He His Met Asp Ala Phe Ala Lys He Phe Glu 
450 455 460 

GAA GGC TTT TGC TCT TTG TCA TAGTCTCTTT AAT AG AAC C A TATTTGCTGC 144 2 
Glu Gly Phe Cys Ser Leu Ser 
465 

AATAAAGTAC CAAGTCCTTT AG T AAC AC TA CACCAAACCC TACTTTCGAG GCGGGAACAC 1502 

CACAACGAGG TTCAATCACT AGAAGGTTGT AC TTC AT AAA TTCCAGAGGT CGAATATACA 156 2 

CCGTTGTCCT CTGAAAAGTT GAACCTCACA CCTGACATGG TGTTACGATA GGTATTGTAT 162 2 

AATGCCATTA TATACTTCCA TAAAGTATCC TATGCAATAG AGAACATGTT ATGTGTTAAA 16 82 
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AAAAAAAAAA AAAAAAAAAA A 1703 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(*vd. ) ORIGINAL SOURCE ; 

(A) ORGANISM: Gentiana triflora va . japonica 
(F) TISSUE TYPE: petal 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA library 

(B) CLONE: pGAT106 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 35.. 1471 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

GAAC CATTGA ATCCAATTAA TCTGATTTAT TAAG ATG GCA GGA AAT TCC GAG 52 

Met Ala Gly Asn Ser Glu 
1 5 

GAT ATC AAA GTT CTT GAG AAA TGC CGT GTT GCG CCA CCA CCG GAC GCC 100 
Asp lie Lys Val Leu Glu Lys Cys Arg Val Ala Pro Pro Pro Asp Ala 
10 15 20 

GTC GCC GAG TTT ACA GTC CCA CTG TCG TTT TTC GAC ATG CGA TGG TTG 14 8 

Val Ala Glu Phe Thr Val Pro Leu Ser Phe Phe Asp Met Arg Trp Leu 
25 30 35 

ATC TCT GAT GCA GAA CAC CAT CTG CAT TTC TAC AGA TTC CGC CAT CCT 196 
lie Ser Asp Ala Glu His His Leu His Phe Tyr Arg Phe Arg His Pro 
40 45 50 

TGT CCC AAC TCT AAA TTT ATC ATT TCA TCC ATT AAA TCG TCC CTT TCC 244 
Cys Pro Asn Ser Lys Phe He He Ser Ser He Lys Ser Ser Leu Ser 
55 60 65 70 

CTT GTT CTC AAA CAC TTT CTT CCG TTA GCC GGG AAT TTG ATT TGG CCG 2 92 

Leu Val Leu Lys His Phe Leu Pro Leu Ala Gly Asn Leu He Trp Pro 
75 80 85 
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GTA GAT TCC TCC GAT AGA ATG CCG GAG TTG CGT TAC AAG AAA GGG GAC 34 0 

Val Asp Ser Ser Asp Arg Met Pro Glu Leu Arg Tyr Lys Lys Gly Asp 
90 95 100 

TCC GTT TCT TTA ACA ATT GCA GAA TCG AGC ATG GAT TTT GAT TAT CTC 388 

Ser Val Ser Leu Thr lie Ala Glu Ser Ser Met Asp Phe Asp Tyr Leu 
105 110 115 

GCC GGA GAT CAT CAG AGG GAT TCT TAT AAA TTC AAC GAT TTG ATT CCG 4 36 

Ala Gly Asp His Gin Arg Asp Ser Tyr Lys Phe Asn Asp Leu lie Pro 
120 125 130 

CAG CTG CCA GAA CCG ATT GTA ACC TCC GGC GAC GAA GTA TTA CCA CTT 4 84 

Gin Leu Pro Glu Pro lie Val Thr Ser Gly Asp Glu Val Leu Pro Leu 

135 140 145 150 

TTT GCT TTA CAG GTG ACG GTG TTC TCC AAC ACC GGT ATA TGC ATT GGA 53 2 

Phe Ala Leu Gin Val Thr Val Phe Ser Asn Thr Gly He Cys Tie Gly 
155 160 165 

CGC AAT CTT CAT CAA GTT CTT GGT GAT GCC AGT TCT TTT CTG CAT TTT 58 0 

Arg Asn Leu His Gin Val Leu Gly Asp Ala Ser Ser Phe Leu His Phe 
170 175 180 

AAT AAA TTA TGG GTT TTG GTT GAC AAA TCC AAT GGA GAT TCA TTA AAG 62 8 

Asn Lys Leu Trp Val Leu Val Asp Lys Ser Asn Gly Asp Ser Leu Lys 
185 190 195 

TTC CTT CCA CTT TCT TCT CTA CCT ATG TAC GAC AGA TCT GTG GTG CAA 6 76 

Phe Leu Pro Leu Ser Ser Leu Pro Met Tyr Asp Arg Ser Val Val Gin 
200 205 210 

GAT CCA TTT CAT ATT CGT CGA AAA ATC TAC AAT GAA AGA AAA CTG CTC 7 24 

Asp Pro Phe His lie Arg Arg Lys lie Tyr Asn Glu Arg Lys Leu Leu 

215 220 225 230 

AAA TCT CAG GGC ACA CCT ACT GTT CTA AAT CCA GCA ATT TCT AAA GAT 7 72 

Lys Ser Gin Gly Thr Pro Thr Val Leu Asn Pro Ala lie Ser Lys Asp 
235 240 245 

GAA GTT CGA GCC ACC TTC ATC CTA CAC CCT ATT GAT ATC ATG AAG CTC 82 0 

Glu Val Arg Ala Thr Phe lie Leu His Pro lie Asp lie Met Lys Leu 
250 255 260 

AAG AAA TTC ATT TCG TCA AAA AAT CGC AAC TTA ACC GGT AGT AGT AAT 86 8 

Lys Lys Phe lie Ser Ser Lys Asn Arg Asn Leu Thr Gly Ser Ser Asn 
265 270 275 

TAT AAT CTG TCA ACT TTC ACG GTG ACA TCT GCA CTG ATC TGG ACA TGC 916 

Tyr Asn Leu Ser Thr Phe Thr Val Thr Ser Ala Leu lie Trp Thr Cys 
280 285 290 

TTG TCG AAA TCA TTA GAC ACC GTC GTA AGA GAG AAG GTG GAA GAG GAT 96 4 

Leu Ser Lys Ser Leu Asp Thr Val Val Arg Glu Lys Val Glu Glu Asp 

295 300 305 310 
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AAA CAT GCA GCA AAC TTA TGT GCT TTC ATC AAC TGC CGA CAA CGT TTT 1012 
Lys His Ala Ala Asn Leu Cys Ala Phe lie Asn Cys Arg Gin Arg Phe 
315 320 325 

GCT CCG CCG ATA CCT CAA AAT TAC TTT GGA AAT TGC ATA GTG CCT TGT 106 0 

Ala Pro Pro lie Pro Gin Asn Tyr Phe Gly Asn Cys lie Val Pro Cys 
330 335 340 

ATG GTG GGA TCG ACT CAT GAG CAA CTT GTA GGA AAT GAA GGG TTG TCG 1108 
Met Val Gly Ser Thr His Glu Gin Leu Val Gly Asn Glu Gly Leu Ser 
345 350 355 

GTA GCT GCA ACC GCC ATC GGA GAT GCT ATC CAT AAG AGG TTA CAT GAC 1156 
Val Ala Ala Thr Ala lie Gly Asp Ala lie His Lys Arg Leu His Asp 
360 365 370 

TAC GAA GGA ATT CTG AGA GGA GAT TGG ATA TCG CCG CCC CGA TCA ACA 12 04 

Tyr Glu Gly He Leu Arg Gly Asp Trp lie Ser Pro Pro Arg Ser Thr 
375 380 385 390 

TCT GCG GCA CCA AGG TCG ACG CTC ATT TAT GTC GTT GGA TCC GCA CAA 12 52 

Ser Ala Ala Pro Arg Ser Thr Leu He Tyr Val Val Gly Ser Ala Gin 
395 400 405 

CGC AAT GTG CAT GAT TTT GAT GCA GAT TTT GGT TGG GGA AAG CTT GAA 13 00 

Arg Asn Val His Asp Phe Asp Ala Asp Phe Gly Trp Gly Lys Leu Glu 
410 415 420 

AAG CAT GAA TCT GTT TCA ACT AAT CCT TCG GCA ACA CTA ATT TTG ATC 1348 
Lys His Glu Ser Val Ser Thr Asn Pro Ser Ala Thr Leu He Leu He 
425 430 435 

TCT CGG TCC AGA AGA TTT AAA GGA GCA CTT GAG CTT GGC ATT TCT TTG 13 96 

Ser Arg Ser Arg Arg Phe Lys Gly Ala Leu Glu Leu Gly He Ser Leu 
440 445 450 

CCT AAG AAT AGG ATG GAC GCA TTT GCC ACC ATT TTT ACG AAT TTC ATC 1444 
Pro Lys Asn Arg Met Asp Ala Phe Ala Thr He Phe Thr Asn Phe He 
455 460 465 470 

AAT AGTCTC CAT GTG AGG AGC CCT TTG TAAGAAAAAA GTGGTATCAA 14 91 
Asn Ser Leu His Val Arg Ser Pro Leu 
475 

TGTATAAAAA AGACAGACAA GTTATGATGC AACAAATGTT TTAGGAGATT ACAAATCCAT 1551 

GGGAAGATGT ATCAAACTCA TCTCTCTATA TATATATATT CAATTGTTTT AAAAAAAAAA 1611 

AAAAAAAAAA A 1622 



(2) INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 05 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Petunia hybrida 
(F) TISSUE TYPE: petal 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA library 

(B) CLONE: pPAT4 8 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 67.. 1410 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TGTCGACGAA ATCCATTTCA TTTCCTCTTC TTTCTTGTTT TTCTAATTTC GTCATCATTG 6 0 

TTATCC ATG GCA GGT GAA GTA GCA AAA CAA GAA GTT ACA AAA GTG AAA 108 
Met Ala Gly Glu Val Ala Lys Gin Glu Val Thr Lys Val Lys 
15 10 

GTC CTG AAA AAA ACA AAC GTG AAA CCA CAT AAA CCA CTA GGA AAA AAA 156 
Val Leu Lys Lys Thr Asn Val Lys Pro His Lys Pro Leu Gly Lys Lys 
15 20 25 30 

GAG TGT CAA TTG GTA ACA TTT GAT CTT CCT TAC CTA GCT TTC TAT TAC 2 04 

Glu Cys Gin Leu Val Thr Phe Asp Leu Pro Tyr Leu Ala Phe Tyr Tyr 
35 40 45 

AAC CAA AAA TTT CTC ATC TAT AAA GGT GCT GAA AAC TTT GAC GAG ACG 2 52 

Asn Gin Lys Phe Leu lie Tyr Lys Gly Ala Glu Asn Phe Asp Glu Thr 
50 55 60 

GTG GAA AAA ATT AAA GAT GGA CTG GCC TTA GTA TTG GTG GAT TTC TAT 3 00 

Val Glu Lys lie Lys Asp Gly Leu Ala Leu Val Leu Val Asp Phe Tyr 
65 70 75 

CAA CTA GCT GGG AAA CTT GGA AAA GAT GAA GAA GGG GTT TTC AGG GTG 34 8 

Gin Leu Ala Gly Lys Leu Gly Lys Asp Glu Glu Gly Val Phe Arg Val 
80 85 90 

GAA TAC GAC GAT GAC ATG GAT GGT GTA GAG GTG ACA GTG GCT GTT GCA 3 96 

Glu Tyr Asp Asp Asp Met Asp Gly Val Glu Val Thr Val Ala Val Ala 
95 100 105 110 
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GAA GAG ATA GAA GTT GCA GAT CTT ACT GAT GAA GAA GGC ACC ACC AAA 44 4 

Glu Glu lie Glu Val Ala Asp Leu Thr Asp Glu Glu Gly Thr Thr Lys 
115 120 125 

TTG CAG GAC TTG ATT CCT TGT AAT AAA ATC TTG AAT TTG GAA GGG CTT 4 92 

Leu Gin Asp Leu lie Pro Cys Asn Lys lie Leu Asn Leu Glu Gly Leu 
130 135 140 

CAT CGC CCT CTT CTA GCT GTG CAG CTC ACC AAG CTC AAG GAC GGG CTC 54 0 

His Arg Pro Leu Leu Ala Val Gin Leu Thr Lys Leu Lys Asp Gly Leu 
145 150 155 

ACC ATG GGA TTA GCA TTT AAC CAT GCT GTG CTG GAT GGT ACT TCG ACG 588 
Thr Met Gly Leu Ala Phe Asn His Ala Val Leu Asp Gly Thr Ser Thr 
160 165 170 

TGG CAC TTT ATG ACC TCG TGG TCC GAG CTT TGC TGT GGG TCC ACC TCA 6 36 

Trp His Phe Met Thr Ser Trp Ser Glu Leu Cys Cyc Gly Ser Thr Ser 
175 180 185 190 

ATT TCT GTC CCA CCA TTC CTT GAA CGA ACC AAG GCT CGT AAC ACT CGA 6 84 

lie Ser Val Pro Pro Phe Leu Glu Arg Thr Lys Ala Arg Asn Thr Arg 
195 200 205 

GTC AAG CTC AAC CTC TCT CAA CCA TCA GAT GCA CCC GAA CAT GCT AAG 732 
Val Lys Leu Asn Leu Ser Gin Pro Ser Asp Ala Pro Glu His Ala Lys 
210 215 220 

TCA GCA ACC AAC GGT GAT GTC CCG GCC AAC GTA GAC CCA CCT CTT CGC 780 
Ser Ala Thr Asn Gly Asp Val Pro Ala Asn Val Asp Pro Pro Leu Arg 
225 230 235 

GAA AGA GTA TTC AAG TTC TCC GAG TTA GCA ATT GAC AAA ATC AAG TCA 82 8 

Glu Arg Val Phe Lys Phe Ser Glu Leu Ala lie Asp Lys lie Lys Ser 
240 245 250 

ACA GTC AAT GCC AAC TCA GGA GAG ACG CCA TTC TCC ACA TTC CAA TCA 876 
Thr Val Asn Ala Asn Ser Gly Glu Thr Pro Phe Ser Thr Phe Gin Ser 
255 260 265 270 

CTC TCC GCA CAC GTG TGG CTA GCC GTC ACA CGT GCG CGC CAA CTC AAG 92 4 

Leu Ser Ala His Val Trp Leu Ala Val Thr Arg Ala Arg Gin Leu Lys 
275 280 285 

CCC GAG GAC TAC ACT GTG TAC ACT GTG TTT GCT GAT TGC AGG AAA AGG 972 
Pro Glu Asp Tyr Thr Val Tyr Thr Val Phe Ala Asp Cys Arg Lys Arg 
290 295 300 

GTT GAT CCT CCA ATG CCA GAA AGT TAC TTC GGC AAC CTA ATT CAG GCA 102 0 

Val Asp Pro Pro Met Pro Glu Ser Tyr Phe Gly Asn Leu lie Gin Ala 
305 310 315 

ATT TTC ACA GTG ACC GCG GCA GGT TTG TTA CTA GCA AGC CCG ATC GAG 1068 
lie Phe Thr Val Thr Ala Ala Gly Leu Leu Leu Ala Ser Pro lie Glu 
320 325 330 
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TTC GCT GGT GGG ATG ATA CAA CAA GCG ATC GTG AAG CAT GAC GCT AAG 1116 

Phe Ala Gly Gly Met He Gin Gin Ala He Val Lys His Asp Ala Lys 
335 340 345 350 

GCC ATT GAT GAA AGA AAC AAG GAG TGG GAG AGC AAC CCG AAG ATC TTT 1164 

Ala He Asp Glu Arg Asn Lys Glu Trp Glu Ser Asn Pro Lys He Phe 

355 360 365 

CAG TAG AAA GAT GCT GGA GTG AAC TGT GTT GCT GTT GGA AGT TCG CCA 1212 

Gin Tyr Lys Asp Ala Gly Val Asn Cys Val Ala Val Gly Ser Ser Pro 
370 375 380 

AGG TTC AAG GTT TAC GAC GTG GAT TTT GGA TGG GGA AAG CCA GAG AGT 12 6 0 

Arg Phe Lys Val Tyr Asp Val Asp Phe Gly Trp Gly Lys Pro Glu Ser 
385 390 395 

GTG AGG AGT GGT TCG AAC AAT AGG TTT GAT GGA ATG GTG TAT TTG TAC 13 08 

VaJ Arg Ser G,l y Ser Asn Asn Arg phe Asp Gly Met Val Tyr Leu Tyr 
400 405 410 

CAA GGC AAA AAT GGA GGA AGA AGC ATT GAT GTG GAG ATT AGT TTG GAA 13 56 

Gin Gly Lys Asn Gly Gly Arg Ser He Asp Val Glu He Ser Leu Glu 
415 420 425 430 

GCA AAT GCT ATG GAG AGG TTG GAG AAA GAT AAA GAG TTC CTC ATG GAA 14 04 

Ala Asn Ala Met Glu Arg Leu Glu Lys Asp Lys Glu Phe Leu Met Glu 

435 440 445 

ACT GCT TAATTTGCTT AGCTTGGACT CAACTGGCTA CACTTTATTT ATGAGCTGCT 14 6 0 

Thr Ala 



' ATGACTCACA TGCATGTATG TTTATTTTTT TTGGAGGGGT TCTTTCCTTT TATTGTTTTC 1520 
TATGTTTTTT CTTTCTTGTA CGTTATGAAG AGAAACCGAG TATAAAGGAA TAATGTTTTC 158 0 

AGTTATTAAA AAAAAAAAAA AAAAA 160 5 



(2) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 7 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Perilla ocimoides 
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(F) TISSUE TYPE: leaf 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA library 

(B) CLONE: pSAT2 08 

<ix) FEATURE : 

(A) NAME / KEY : CDS 

(B) LOCATION: 3 . . 1340 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



CC GTG ATC GAA ACG TGT AGA GTT GGG CCG CCG CCG GAC TCG GTG GCG 4 7 

Val lie Glu Thr Cys Arg Val Gly Pro Pro Pro Asp Ser Val Ala 
15 10 15 

GAG CAA TCG GTG CCG CTC ACA TTC TTC GAC ATG ACG TGG CTG CAT TTT 95 
Glu Gin Ser Val Pro Leu Thr Phe Phe Asp Met Thr Trp Leu His Phe 
20 25 30 

CAT CCC ATG CTT CAG CTC CTC TTC TAC GAA TTC CCT TGT TCC AAG CAA 14 3 

His Pro Met Leu Gin Leu Leu Phe Tyr Glu Phe Pro Cys Ser Lys Gin 
35 40 45 

CAT TTT TCA GAA TCC ATC GTT CCA AAA CTC AAA CAA TCT CTC TCT AAA 191 
His Phe Ser Glu Ser lie Val Pro Lys Leu Lys Gin Ser Leu Ser Lys 
50 55 60 

ACT CTC ATA CAC TTC TTC CCT CTC TCA TGC AAT TTA ATC TAC CCT TCA 23 9 

Thr Leu lie His Phe Phe Pro Leu Ser Cys Asn Leu lie Tyr Pro Ser 
65 70 75 

TCC CCG GAG AAA ATG CCG GAG TTT CGG TAT CTA TCC GGG GAC TCG GTT 287 
Ser Pro Glu Lys Met Pro Glu Phe Arg Tyr Leu Ser Gly Asp Ser Val 
80 85 90 95 

TCT TTC ACC ATC GCA GAA TCT AGC GAC GAC TTC GAT GAT CTC GTC GGA 33 5 

Ser Phe Thr lie Ala Glu Ser Ser Asp Asp Phe Asp Asp Leu Val Gly 
100 105 110 

AAT CGT CCA GAA TCT CCC GTT AGG CTC TAC AAC TTT GTC CCT AAA TTG 383 
Asn Arg Pro Glu Ser Pro Val Arg Leu Tyr Asn Phe Val Pro Lys Leu 
115 120 125 

CCG CCC ATT GTC GAA GAA TCC GAT AGA AAA CTC TTC CAA GTT TTC GCC 431 
Pro Pro lie Val Glu Glu Ser Asp Arg Lys Leu Phe Gin Val Phe Ala 
130 135 140 

GTG CAG GTG ACT CTT TTC CCA GGC CGA GGC GTC GGT ATT GGA ATA GCA 479 
Val Gin Val Thr Leu Phe Pro Gly Arg Gly Val Gly lie Gly lie Ala 
145 150 155 

ACG CAT CAC ACC GTT AGC GAC GCC CCG TCG TTT CTC GCG TTT ATA ACG 52 7 

Thr His His Thr Val Ser Asp Ala Pro Ser Phe Leu Ala Phe lie Thr 
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160 165 170 175 

GCT TGG TCT TCA ATG AGC AAA CAC ATT GAA AAT GAA GAT GAA GAT GAA 57 5 

Ala Trp Ser Ser Met Ser Lys His lie Glu Asn Glu Asp Glu Asp Glu 
180 185 190 

GAA TTT AAA TCT TTG CCA GTT TTC GAT AGA TCC GTC ATA AAA TAT CCG 62 3 

Glu Phe Lys Ser Leu Pro Val Phe Asp Arg Ser Val lie Lys Tyr Pro 
195 200 205 

ACG AAA TTT GAC TCC ATT TAT TGG AGA AAC GCG CTA AAA TTT CCT TTG 671 
Thr Lys Phe Asp Ser lie Tyr Trp Arg Asn Ala Leu Lys Phe Pro Leu 
210 215 220 

CAA TCT CGT CAT CCC TCA TTA CCG ACG GAC CGC ATT CGA ACC ACG TTC 719 
Gin Ser Arg His Pro Ser Leu Pro Thr Asp Arg lie Arg Thr Thr Phe 
225 230 235 

GTT TTC ACC CAA TCC AAA ATT AAG AAA TTG AAG GGT TGG ATT CAG TCC 767 
Val Phe Thr Gin Ser Lys lie Lys Lys Leu Lys Gly Trp lie Gin Ser 
240 245 250 ' 255 

AGA GTT CCA AGT TTA GTC CAT CTC TCA TCT TTT GTA GCG ATT GCA GCT 815 
Arg Val Pro Ser Leu Val His Leu Ser Ser Phe Val Ala lie Ala Ala 
260 265 270 

TAT ATG TGG GCT GGC ATA ACG AAA TCA TTC ACA GCA GAT GAA GAC CAA 863 
Tyr Met Trp Ala Gly lie Thr Lys Ser Phe Thr Ala Asp Glu Asp Gin 
275 280 285 

GAC AAC GAG GAT GCA TTT TTC TTG ATT CCG GTC GAT CTA AGG CCA CGA 911 
Asp Asn Glu Asp Ala Phe Phe Leu He Pro Val Asp Leu Arg Pro Arg 
290 295 300 

TTA GAT CCG CCG GTT CCT GAA AAT TAC TTC GGG AAC TGC TTA TCG TAC 95 9 

Leu Asp Pro Pro Val Pro Glu Asn Tyr Phe Gly Asn Cys Leu Ser Tyr 
305 310 315 

GCG CTG CCG AGA ATG CGG CGG CGA GAG CTG GTG GGA GAG AAA GGG GTG 1007 
Ala Leu Pro Arg Met Arg Arg Arg Glu Leu Val Gly Glu Lys Gly Val 
320 325 330 335 

TTT CTG GCA GCT GAG GTA ATC GCG GCG GAG ATA AAA AAA AGG ATC AAC 1055 
Phe Leu Ala Ala Glu Val He Ala Ala Glu He Lys Lys Arg He Asn 
340 345 350 

GAC AAG AGA ATA TTA GAA ACG GTG GAG AAA TGG TCG CCG GAG ATT CGT 1103 
Asp Lys Arg He Leu Glu Thr Val Glu Lys Trp Ser Pro Glu He Arg 
355 360 365 

AAA GCG TTG CAG AAA TCA TAT TTT TCG GTG GCA GGA TCG AGC AAG CTA 1151 
Lys Ala Leu Gin Lys Ser Tyr Phe Ser Val Ala Gly Ser Ser Lys Leu 
370 375 380 
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GAT CTT TAC GGT GCA GAT TTT GGA TGG GGG AAG GCG AGA AAG CAA GAA 
Asp Leu Tyr Gly Ala Asp Phe Gly Trp Gly Lys Ala Arg Lys Gin Glu 
385 390 395 



1199 



ATA TTG TCG ATT GAT GGG GAG AAA TAT GCA ATG ACG CTT TGT AAA GCC 
lie Leu Ser lie Asp Gly Glu Lys Tyr Ala Met Thr Leu Cys Lys Ala 
400 405 410 415 



1247 



AGG GAT TTC GAA GGA GGA TTG GAG GTT TGC TTG TCT TTG CCT AAG GAC 
Arg Asp Phe Glu Gly Gly Leu Glu Val Cys Leu Ser Leu Pro Lys Asp 
420 425 430 



1295 



AAA ATG GAT GCT TTT GCT GCT TAT TTT TCA CTG GGA ATT AAT GGT 
Lys Met Asp Ala Phe Ala Ala Tyr Phe Ser Leu Gly lie Asn Gly 
435 440 445 



1340 



TAATAAATGT ATGTAATTAA ACTAATATTA TTATGTAACA ATTAATTAAG TGTTGAGTAA 



1400 



CGTGAAGAAT AATCCCTATT ATATATTTAT GATTTGGTTC AAATAAAGTG TAAAGCCTCT 



1460 



TGAAAAAAAA AAAAAAAAA 



1479 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Senecio cruentus 
(F) TISSUE TYPE: petal 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA library 

(B) CLONE: pCAT8 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . . 1364 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

TG AAC ATT CTC GAA CAT GCC CGA ATA TCG GCC CCC TCG GGC ACC ATC 4 7 

Asn lie Leu Glu His Ala Arg lie Ser Ala Pro Ser Gly Thr lie 
15 10 15 
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GGC CAT CGC TCG TTA TCT CTT ACT TTC TTC GAC ATT ACT TGG CTA CTC 95 
Gly His Arg Ser Leu Ser Leu Thr Phe Phe Asp lie Thr Trp Leu Leu 
20 25 30 

TTC CCT CCG GTC CAC CAT CTT TTC TTC TAT GAC TTT CCA CAT TCT AAA 14 3 

Phe Pro Pro Val His His Leu Phe Phe Tyr Asp Phe Pro His Ser Lys 
35 40 45 

TCC CAT TTC ATG GAC ACT ATT GTT CCC AGG CTA AAA CAA TCT TTA TCG 191 
Ser His Phe Met Asp Thr lie Val Pro Arg Leu Lys Gin Ser Leu Ser 
50 55 60 

GTC ACT CTT CAA CAT TTT TTC CCG TTT GCT AGT AAT TTG ATT GTA TTT 239 
Val Thr Leu Gin His Phe Phe Pro Phe Ala Ser Asn Leu lie Val Phe 
65 70 75 

CCT AAC ACT GAT GGT TCG GGT TTT AAT AAA AAA CCA GAA ATA AAA CAC 287 
Pro Asn Thr Asp Gly Ser Gly Phe Asn Lys Lys Pro Glu lie Lys His 
80 85 90 95 

GTT GAA GGT GAT TCT GTT GTG GTT ACT TTT GCA GAA TGT TGT CTT GAC 33 5 

Val Glu Gly Asp Ser Val Val Val Thr Phe Ala Glu Cys Cys Leu Asp 
100 105 110 

TTT AAT AAT TTG ACA GGA AAT CAT CCT CGA AAA TGT GAA AAC TTT TAT 383 
Phe Asn Asn Leu Thr Gly Asn His Pro Arg Lys Cys Glu Asn Phe Tyr 
115 120 125 

CCA CTT GTA CCT TCA TTG GGA AAT GCA ATC AAA TTA TGT GAT TGC GTC 431 
Pro Leu Val Pro Ser Leu Gly Asn Ala lie Lys Leu Cys Asp Cys Val 
130 135 140 

ACG GTC CCA CTT TTT TCA CTT CAA GTG ACG TTT TTT CCG GGC TCG GGT 47 9 

Thr Val Pro Leu Phe Ser Leu Gin Val Thr Phe Phe Pro Gly Ser Gly 
145 150 155 

ATA TCA CTA GGA ATG ACG AAT CAT CAT AGC CTT GGT GAC GCT AGC ACG 52 7 

lie Ser Leu Gly Met Thr Asn His His Ser Leu Gly Asp Ala Ser Thr 
160 165 170 175 

CGG TTC AAC TTT TTG AAA GGG TGG ACT TCG ATT ATT CAA TCT GGT GTA 575 
Arg Phe Asn Phe Leu Lys Gly Trp Thr Ser lie lie Gin Ser Gly Val 
180 185 190 

GAT CGG TCT TTT TTA ACG AAA GGA TCT CCA CCG GTT TTT GAT AGA TTG 62 3 

Asp Arg Ser Phe Leu Thr Lys Gly Ser Pro Pro Val Phe Asp Arg Leu 
195 200 205 

ATT AAC ATC CCA CAT TTA GAT GAA AAT AAG TTG AGA CAT ACA AGG CTC 671 
lie Asn lie Pro His Leu Asp Glu Asn Lys Leu Arg His Thr Arg Leu 
210 215 220 

GAA AGT TTT TAT AAA CCT TCG AGC CTT GTT GGT CCC ACT GAT AAA GTT 719 
Glu Ser Phe Tyr Lys Pro Ser Ser Leu Val Gly Pro Thr Asp Lys Val 
225 230 235 
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CGG TCA ACG TTT GTG TTG ACC CGA ACT AAT ATC AAT CTA CTA AAG AAA 76 7 

Arg Ser Thr Phe Val Leu Thr Arg Thr Asn lie Asn Leu Leu Lys Lys 
240 245 250 255 

AAG GTC TTA ACC CAA GTG CCA AAC TTG GAG TAC ATG TCA TCT TTT ACG 815 

Lys Val Leu Thr Gin Val Pro Asn Leu Glu Tyr Met Ser Ser Phe Thr 
260 265 270 

GTA ACT TGT GGT TAT ATA TGG AGT TGC ATA GCG AAA TCA CTC GTA AAA 86 3 

Val Thr Cys Gly Tyr lie Trp Ser Cys lie Ala Lys Ser Leu Val Lys 
275 280 285 

ATA GGA GAA AGA AAG GGC GAA GAC GAG TTA GAA CAG TTC ATA ATC ACC 911 

lie Gly Glu Arg Lys Gly Glu Asp Glu Leu Glu Gin Phe lie lie Thr 
290 295 300 



ATT GAT TGT CGA TCT CGT CTT GAT CCA CCA ATT CCC ACA GCC TAC TTT 95 9 

Asp Cyc Arg Ser Arg Leu Asp Pre Pro lie Pre Thr Ala Tyr Ph? 
305 310 315 



GGT AAC TGT GGT GCA CCA TGT GTC CCG ACC TTA AAA AAT GTC GTT TTG 100 7 

Gly Asn Cys Gly Ala Pro Cys Val Pro Thr Leu Lys Asn Val Val Leu 
320 325 330 335 

ACT ACG GAA AAT GGG TAT GCA CTT GGT GCT AAA GTA ATT GGA GAG TCT 105 5 

Thr Thr Glu Asn Gly Tyr Ala Leu Gly Ala Lys Val lie Gly Glu Ser 
340 345 350 

ATA TGC AAA ATG ATA TAT AAT AAG GAC GGA ATC TTG AAA GAT GCC GCG 1103 
lie Cys Lys Met lie Tyr Asn Lys Asp Gly lie Leu Lys Asp Ala Ala 
355 360 365 

AGA TGG CAT GAA CCT TTC ATG ATC CCG GCT AGG AAG ATT GGT GTT GCT 1151 
Arg Trp His Glu Pro Phe Met lie Pro Ala Arg Lys lie Gly Val Ala 
370 375 380 

GGT ACA CCT AAG CTC AAC TTG TAC GAC TTT GAT TTT GGG TGG GGG AAG 119 9 

Gly Thr Pro Lys Leu Asn Leu Tyr Asp Phe Asp Phe Gly Trp Gly Lys 
385 390 395 

CGC ATA AAG TAT GAG ACT GTT TCA ATA GAC TAT AAT ACG TCG ATT TCT 124 7 

Arg lie Lys Tyr Glu Thr Val Ser lie Asp Tyr Asn Thr Ser lie Ser 
400 405 410 415 

ATA AAT GCA AGC AAA ACA TCA GCA CAA GAT CTT GAA ATT GGA TTG AGT 12 9 5 

lie Asn Ala Ser Lys Thr Ser Ala Gin Asp Leu Glu lie Gly Leu Ser 
420 425 430 

CTA CCG AGT ATG CAA ATG GAG GCG TTT TCT AGC ATC TTT GAT GAA GGA 134 3 

Leu Pro Ser Met Gin Met Glu Ala Phe Ser Ser lie Phe Asp Glu Gly 
435 440 445 

TTA GAG AGT CAA GTT TCA TTG TAGATCATCG TCCCCTTTTT GTGTGCATCA 13 94 

Leu Glu Ser Gin Val Ser Leu 
450 
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AGTTTCTGTC GTTTTTATGA GTTGCCACTG TTCTATTCTT TAAGTAT AC C TTTCG AC TAT 14 54 

GTTTTGAAGA TGCAACGATA TAAAATGAAA AAAAAAAAAA AAAAAAAAAA AAAA 15 08 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1522 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 



( .1 v ) A NT I - S ENS E : NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lavandula angustifolia 
<F) TISSUE TYPE: petal 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA library 

(B) CLONE: pLAT2 1 



<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 1352 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



NTG ACC ACC CTC CTC GAA TCC TCC CGA GTG GCG CCG CCT CCA GGC ACG 4 8 

Xaa Thr Thr Leu Leu Glu Ser Ser Arg Val Ala Pro Pro Pro Gly Thr 
15 10 15 

GTG GCT GAG CAG TCA CTC CCG CTC ACC TTC TTC GAC ATG ACG TGG CTG 96 

Val Ala Glu Gin Ser Leu Pro Leu Thr Phe Phe Asp Met Thr Trp Leu 
20 25 30 

CAT TTC CAC CCC ATG CTT CAG CTT CTC TTC TAC GAA CTC CCC TGT TCC 144 

His Phe His Pro Met Leu Gin Leu Leu Phe Tyr Glu Leu Pro Cys Ser 
35 40 45 

AAA CCC GCC TTC CTC GAA ACC GTC GTT CCG AAA CTC AAA CAA TCC TTA 192 

Lys Pro Ala Phe Leu Glu Thr Val Val Pro Lys Leu Lys Gin Ser Leu 
50 55 60 

TCT CTA ACC CTC AAA CAC TTC TTC CCC CTT TCA TGC AAT CTA ATC TAC 24 0 

Ser Leu Thr Leu Lys His Phe Phe Pro Leu Ser Cys Asn Leu lie Tyr 
65 70 75 80 
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CCT CTA TCG CCG GAG AAA ATG CCG GAG TTC CGG TAT CAG AAC GGT GAC 2 88 

Pro Leu Ser Pro Glu Lys Met Pro Glu Phe Arg Tyr Gin Asn Gly Asp 
85 90 95 

TCG GTT TCT TTC ACG ATT ATG GAG TCT GTC GGA GAT CAT CCG CAT TCC 3 36 

Ser Val Ser Phe Thr lie Met Glu Ser Val Gly Asp His Pro His Ser 
100 105 110 

GCT CAT AAA TAC TAC TGC TTT GCC CCT AGC GAC GAT TAT GAA GAT CTC 384 
Ala His Lys Tyr Tyr Cys Phe Ala Pro Ser Asp Asp Tyr Glu Asp Leu 
115 120 125 

CAG CTG CCG CCG ATA GTC GAG GAA TCT GAT CGG AAA TTG TTT CAA GTT 432 
Gin Leu Pro Pro lie Val Glu Glu Ser Asp Arg Lys Leu Phe Gin Val 
130 135 140 

TTA GCC GTG CAA GTG ACT CTG TTT CCC GGT CGC GGG GTG TGC ATC GGA 48 0 

Leu Ala Val Gin Val Thr Leu Phe Pro Gly Arg Gly Val Cys lie Gly 
145 150 155 160 

ATA ACG ACG CAC CAC ACC GTT AGC GAT GCT CCA TCG TTT GTA GGG TTT 52 8 

lie Thr Thr His His Thr Val Ser Asp Ala Pro Ser Phe Val Gly Phe 
165 170 175 

ATG AAG AGT TGG GCT TCC ATC ACT AAA TTC GGA GGA GAT GAT GAA TTC 5 76 

Met Lys Ser Trp Ala Ser lie Thr Lys Phe Gly Gly Asp Asp Glu Phe 
180 185 190 

TTG GAC GGA AAA GGT GAA TGT TTG CCG GTT TTC GAC CGA TCG CTC GTG 624 
Leu Asp Gly Lys Gly Glu Cys Leu Pro Val Phe Asp Arg Ser Leu Val 
195 200 205 

AAT TAT CCG CCT AAA TTG GAC ACA TAT TTA TGG AAC AAC GCG CAG AAA 6 72 

Asn Tyr Pro Pro Lys Leu Asp Thr Tyr Leu Trp Asn Asn Ala Gin Lys 
210 215 220 

CGT CCG TTG GAA TCG CAG CAT CCA TCT TTA CCG ACG GAT CGG ATT CGA 72 0 

Arg Pro Leu Glu Ser Gin His Pro Ser Leu Pro Thr Asp Arg lie Arg 
225 230 235 240 

GCT ACC TAC CTT TTC ACC CAA TCT GAA ATT AAG AAA TTG AAG GGT TTG 76 8 

Ala Thr Tyr Leu Phe Thr Gin Ser Glu lie Lys Lys Leu Lys Gly Leu 
245 250 255 

ATT CAG AGA AAA GCC CCA AAT GTA GTT AAT CTC TCT TCC TTC GTC GCG 816 
lie Gin Arg Lys Ala Pro Asn Val Val Asn Leu Ser Ser Phe Val Ala 
260 265 270 

ATC GCA GCT TAT ATC TGG ACC GGC ATC GCC AAA TCG GTC GGA GAT TAC 864 
lie Ala Ala Tyr lie Trp Thr Gly lie Ala Lys Ser Val Gly Asp Tyr 
275 280 285 

AAA GAC GTG GAT GAC GAC AAA CGC GCT TTC TTT TTA ATT CCG ATC GAT 912 
Lys Asp Val Asp Asp Asp Lys Arg Ala Phe Phe Leu lie Pro lie Asp 
290 295 300 
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TTA AGG CCG CGT TTG GAT CCG CCG GCT CCG GGG AAC TAG TTC GGA AAC 96 0 

Leu Arg Pro Arg Leu Asp Pro Pro Ala Pro Gly Asn Tyr Phe Gly Asn 
305 310 315 320 

TGT CTA TCG TTT GCG ATG GCG AAG ATC CTG CGG CGG GAT TTG GTC GGA 1008 
Cys Leu Ser Phe Ala Met Ala Lys lie Leu Arg Arg Asp Leu Val Gly 
325 330 335 

GAT GAA GGG GTG TTT CGG GCA GCT GAG GCG ATC GCG GCG GAA ATA GAG 1056 
Asp Glu Gly Val Phe Arg Ala Ala Glu Ala lie Ala Ala Glu lie Glu 
340 345 350 

AAG AGG ACG AGC GAC AAG AAG ATT CTA GAA ACT GTG GAG AAC TGG CCG 1104 
Lys Arg Thr Ser Asp Lys Lys lie Leu Glu Thr Val Glu Asn Trp Pro 
355 360 365 

TCT GAG ATT CGC GAA GCC TTG CAA AAC TGT TAT TTC TCG GTG GCG GGA 1152 
Ser Glu lie Arg Glu Ala Leu Gin Asn Cys Tyr Phe Ser Val A.la Gly 
370 375 380 

TCG AGC AGG CTT GAT CTT TAC GGC GCG GAT TTT GGA TGG GGT AAG GCG 12 0 0 

Ser Ser Arg Leu Asp Leu Tyr Gly Ala Asp Phe Gly Trp Gly Lys Ala 
385 390 395 400 

GTG AAG CAA GAG ATA CTG TCG ATT GAT GGA GAG AAG TTT ACG ATG TCG 124 8 

Val Lys Gin Glu lie Leu Ser lie Asp Gly Glu Lys Phe Thr Met Ser 
405 410 415 

TTG TGT AAA CCG AGG GAT GCT GCC GGA GGA TTG GAG GTT GGA TTG TCT 12 96 

Leu Cys Lys Pro Arg Asp Ala Ala Gly Gly Leu Glu Val Gly Leu Ser 
420 425 430 

TTG CCA AAG GAG GAA TTG CAA GCT TTT GAT GAT TAT TTT GCG GAG GGA 1344 
Leu Pro Lys Glu Glu Leu Gin Ala Phe Asp Asp Tyr Phe Ala Glu Gly 
435 440 445 

ATA AAG GGT TGATTAATCA TTTAAT C ATG TATTATGAAG TTGGATGAAA 13 93 
lie Lys Gly 
450 

TCCTCTGTTT CATCTCTATT GTTTAAACAA TAATTTTTTT CCATTGAACT TTTTTGAGTC 14 53 

AATAAAAAAA AAAAAAAAAA AAAAAAAATG AAAAAACTCA GTTATTTTTT TTTTTTTTTT 1513 

TTTTTTTTT 152 2 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Arg Phe Leu Gly lie Thr Gly Ser Pro Lys 
15 10 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

lie His Met Asp Ala Phe Ala Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Gly Val Glu lie Gly Val Ser Leu Pro Lys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
* (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ala Ser Leu Ser Leu Thr Leu Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

His Tyr Val Pro Leu Ser Gly Asn Leu Leu Met Pro lie Lys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Val Arg Ala Thr Tyr Val Leu Ser Leu Ala Glu lie Gin Lys 
15 10 



(2) INFORMATION FOR SEQ ID NO : 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHET I CAL : NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

lie His Met Asp Ala Phe Ala Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Lys lie His Met Asp Ala Phe Ala Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Lys lie His Met Asp Ala Phe Ala 
1 5 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
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AARATHCAYA TGGAYGCNTT YGC 23 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
PTPHAniTTTT TTTTTTTTTT TTT 2 3 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TTCACCATGG AGCAAATCCA AATGGT 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGAGTCGCCC TCATCAC 



(2) INFORMATION FOR SEQ ID NO: 20: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
AACAGCTATG ACCATG 16 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Asp Phe Gly Trp Gly Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GAYTTYGGNT GGGGNAA 



(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
TGGCAACTGT CTTGCGTCAT G 



(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STFAJSTDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
CCATGTCAGG TGTGAGGTTC AAC 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
ATCGTTTCGC ATGATTGAAC 



(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
TCAGAAGAAC TCGTCAAGAA 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 12 . . 53 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GGGATCCAAC A ATG GAG CAA ATC CAA ATG GTG GCC GTG ATC GAA ACG TGT 5 0 

Met Glu Gin lie Gin Met Val Ala Val lie Glu Thr Cys 
15 10 

AG A 53 
Arg 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GTAAAACGAC GGCCAT 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 12.. 45 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GGGATCCAAC A ATG GAG CAA ATC CAA ATG GTG AAC ATT CTC GAA C 4 5 

Met Glu Gin lie Gin Met Val Asn lie Leu Glu 
15 20 25 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CTCGGAGGAA TTCGG C ACGA C 21 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 18.. 35 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 



AGTCGGATCC AACAATG ACC ACC CTC CTC GAA TCC 

Thr Thr Leu Leu Glu Ser 
15 



